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Abstract 

Pull-tabbing is an evaluation approach for functional logic computations, based on a graph trans- 
formation recently proposed, which avoids making irrevocable non-deterministic choices that would 
jeopardize the completeness of computations. In contrast to other approaches with this property, it 
does not require an upfront cloning of a possibly large portion of the choice's context. We formally 
define the pull-tab transformation, characterize the class of programs for which the transformation 
is intended, extend the computations in these programs to include the transformation, and prove the 
correctness of the extended computations. 

KEYWORDS: Functional Logic Programming, Non-Determinism, Graph Rewriting, Pull-tabbing 



1 Introduction 

Functional logic programming ( j Antoy and Hanus 2010[ ) joins in a single paradigm the fea- 
tures of functional programming with those of logic programming. Logic programming 
contributes logic variables that are seamlessly integrated in functional computations by 
narrowing. The usefulness and elegance of programming with narrowing is presented in 
( | An toy and Hanus 2002||Antoy 2010} . At the semantics level free variables are equivalent 



to non-deterministic functions (Antoy and Hanus 20061, i.e., functions that for some ar 



guments may return any one of many results. Thus, at the implementation level variables 
can be replaced by non-deterministic functions when non-deterministic functions appear 
simpler, more convenient and/or more efficient to implement ( Brassel and Huch 20071 >. 
This paper focuses on a graph transformation recently proposed for the implementation 
of non-determinism of this kind. This transformation is intended to ensure the complete- 
ness of computations without cloning too eagerly a large portion of the context of a non- 
deterministic step. The hope is that steps following the transformation will create condi- 
tions that make cloning the not yet cloned portion of the context unnecessary. 

2 Motivation 



Non-determinism is certainly the most characterizing and appealing feature of functional 
logic programming. It enables encoding potentially difficult problems into relatively sim- 
pler programs. For example, consider the problem of abstracting the dependencies among 
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the elements of a set such as the functions of a program or the widgets of a graphical 
user interface. In abstractions of this kind, component parts "build" composite objects. A 
non-deterministic function, builds, defines which objects are dependent on each part. The 
syntax is Curry (IHanus 20061 >. 

builds pi = ol 

builds pi = o2 

builds p2 = ol 

A part can build many objects, e.g.: part pi builds objects ol and o2. Likewise, an object 
can be built from several parts, e.g.: object ol is built by parts pi and p2. Many-to-many 
relationships, such as that between objects and parts just sketched, are difficult to abstract 
and to manipulate in deterministic languages. However, in a functional logic setting, the 
non-deterministic function builds is straightforward to define and is sufficient for all other 
basic functions of the abstraction. 

For example, a function that non-deterministically computes a part of an object is simply 
defined by: 

is_built_by (builds x) = x (2) 



where is_built_by is defined using afunctional pattern ( Antoy and Hanus 2005 1. The set 



of all the parts of an object is computed by is_built_by ' set, the implicitly defined set 



function (Antoy and Hanus 2009 1 of is_built_by. 

The simplicity of design and ease of coding offered by functional logic languages through 
non-determinism do not come for free. The burden unloaded from the programmer is 
placed on the execution. All the alternatives of a non-deterministic choice must be ex- 
plored to some degree to ensure that no result of a computation goes missing. Doing this 
efficiently is a subject of active research. Below, we summarize the state of the art. 



3 Approaches 

There are three main approaches to the execution of non-deterministic steps in a functional 



logic program. A fourth approach, called pull-tabbing (Alqaddoumi et al. 2010 1, still un- 
derdeveloped, is the subject of this paper. Pull-tabbing offers some appealing characteris- 
tics missing from the other approaches. 

3.1 Running example 



We borrow from ( Alqaddoumi et al. 2010 1 a simple example to present the existing ap- 
proaches and understand their characteristics: 
flip 0=1 

flip 1 = (3) 
coin = ? 1 
We want to evaluate the expression 

(flip x, flip x) where x = coin (4) 

We recall that '?' is a library function, called choice, that returns either of its arguments, 
i.e., it is defined by the rules: 
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x ? _ = x rale C\ 

_ ? y = y rale C 2 

and that the where clause introduces a shared expression. Every occurrence of x in has 
the same value throughout the entire computation according to the call-time choice seman- 
tics (|HussmannT992l Lopez-Fraguas et al. 2007 1. By contrast, in (flip coin, flip coin) 



each occurrence of coin is evaluated independently of the other. Fig. Q] highlights the dif- 
ference between these two expressions when they are depicted as graphs. 

(,) (,) 

flip flip flip flip 



Fig. 1. Depiction of $4^ (left) and of (flip coin, flip coin) (right) as graphs. The 
symbol ( , ) denotes the pair constructor. 

A context is an expression with a distinguished symbol called hole denoted '[]'. If C is a 
context, C[x] is the expression obtained by replacing the hole in C with x. E.g., the expres- 
sion in (01 can be written as C[coin], in which C is (flip x, flip x) where x = []. 
The context [ ] is called empty context. An expression rooted by a node n labeled by the 
choice symbol is informally referred to as a choice and each argument of the choice sym- 
bol, or successor of n, is referred to as a choice's alternative. 



3.2 Previous approaches 

Backtracking is the most traditional approach to non-deterministic computations in func- 
tional logic programming. Evaluating a choice in some context, say C[ulv\, consists in 
selecting either alternative of the choice, e.g., u (the criterion for selecting the alternative 
is not relevant to our discussion), replacing the choice with the selected alternative, which 
gives C[u], and continuing the computation. In typical interpreters, if and when the com- 
putation of C[u] completes, the result is consumed, e.g., printed, and the user is given the 
option to either terminate the execution or compute C[v]. Backtracking is well-understood 
and relatively simple to implement. It is employed in successful languages such as Pro- 
log dlSO 19951) and in language implementations such as PAKCS dHanus 20081) and T Oy 
dCaballero and Sanchez 200"7l l. The major objection to backtracking is its incompleteness. 
If the computation of C[u] does not terminate, no result of C[v] is ever obtained. 

Copying (or cloning) is an approach that fixes the inherent incompleteness of backtrack- 
ing. Evaluating a choice in some context, say C[u?v], consists in evaluating simultaneously 
(e.g., by interleaving steps) and independently both C[u] and C[v], In typical interpreters, 
if and when the computation of either completes, the result is consumed, e.g., printed, and 
the user is given the option to either terminate the execution or continue with the computa- 
tion of the other. Copying is simpler than backtracking and it is used in some experimental 
implementations of functional logic languages (Antoy et al. 2005 ITolmach et al. 20041 ). 



The major objection to copying is the significant investment of time and memory made 
when a non-deterministic step is executed. If an alternative of a choice eventually fails, 
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cloning the context may have been largely useless. For a contrived example, notice that in 
l+(2+(. . .+ (n 'div' coin) . . .) ) an arbitrarily large context is cloned when the choice 
is evaluated, but for one alternative this context is almost immediately discarded. 

Bubbling is an approach proposed to avoid the drawbacks of backtracking and copying 
( |Antoy et al. 2006 Lopez-Fraguas et al. 2008 1. Bubbling is similar to copying, in that it 



clones a portion of the context of a choice to concurrently compute all its alternatives, but 
the portion of cloned context is typically smaller than the entire context. We recall that in a 
rooted graph g, a node d is a dominator of a node n, proper when d ^ n, iff every path from 
the root of g to n contains d. An expression C[m?u] can be seen as Ci[C2[it?u]] in which 
the root of C*2 is a dominator of the hole. A trivial case arises when C\ = [} and C*2 = C. 
Evaluating a choice in some context, say C[ulv], distinguishes whether or not C is empty. 
If C is the empty context, u and v are evaluated simultaneously and independently, as in 
copying, but there is no context to clone. Otherwise, the evaluation consists in finding C\ 
and Ci such that C[ulv] = C\[C2[ulv\\ and the root of C2 is a proper dominator of the 
choice, and evaluating Ci[C2[w]?C2[w]]. When C\ is the empty context, then bubbling is 
exactly as copying. Otherwise a smaller context, i.e., C2 instead of C, is cloned. Bub- 
bling intends to reduce cloning in hopes that some alternative of a choice will quickly fail. 



An objection to bubbling is the cost of find- 
ing a choice's immediate dominator and the 
risk of paying this cost repeatedly for the 
same choice. This cost entails traversing a 
possibly-large portion of the choice's con- 
text. Traversing the context is more effi- 
cient than cloning it, since cloning requires 
node construction in addition to the traver- 
sal, but it is still unappealing, since the 
cost of a non-deterministic step is not pre- 
dictable and it may grow with the size of an 
expression. 

3.3 Pull-Tabbing 

Pull-tabbing, which is at the core of our work, was first sketched in ( Alqaddoumi et al. 2010) > 
The name "pull-tab" originates from the metaphor of pulling the tab of a zipper. For an ex- 
pression, a choice is a tab and a choice's spine is a zipper. As the tab/choice is pulled up, 
the zipper/spine opens into two equal strands each of which has a different alternative of 
the choice at the end. 

Evaluating a choice in some context, say C[it?t>], distinguishes whether or not C is 
empty. If C is empty, u and v are evaluated simultaneously and independently, as in copy- 
ing and bubbling, without any context to clone. Otherwise, the expression to evaluate is of 
the form C[s(ulv)], for some symbol s (for ease of presentation we assume that s is unary, 
but there are no restrictions on its arity) and some context C. Pull-tabbing transforms the 
expression into C[s(u)?s(v)}. Without some caution, this transformation is unsound. 
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Fig. 2. Graph depiction of the state of the 
computation of (|4]l after a bubbling step. 
Since the dominator of the choice is the 
root, bubbling and copying are the same for 
this example. 
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Unsoundness may occur when some choice has two predecessors, as in our running 
example. The choice will be pulled up along two paths creating two pairs of strands that 
eventually must be pair- wise combined together. Some combinations will contain mutually 
exclusive alternatives, i.e., subexpressions impossible to obtain in languages such as Curry 
and TOy that adopt the call-time choice semantics. Fig. [3] presents an example of this 
situation. 



(,) (,) C) 




Fig. 3. Initial portion of the computation of ® with pull-tabbing. The choice in the second 
expression is being pulled up along the left path to the root. This computation would even- 
tually produce several results including (1,0) which mixes the left and right alternatives 
of the same choice. 



We will show that the soundness is recovered if the left and right alternative of a choice are 
not combined in the same expression. To this aim, we attach an identifier to each choice 
of an expression. We preserve this identifier when a choice is pulled up. If eventually the 
choice is reduced to either of its alternatives every other choice with the same identifier 
must be reduced to the same alternative. A very similar idea in a rather different setting is 
proposed in dBrassel and Huch 2007 ; Brassel ^Ol IK 

A pull-tab step clones a single node, a predecessor of the choice being pulled up. If the 
choice is pulled all the way up to the root of an expression, the choice's entire spine is 
cloned. But if an alternative of the choice fails before the choice reaches the root, further 
cloning of the choice's context becomes unnecessary. 



4 Formalization 
4.1 Background 

We define a term graph in the customary way dEchahed and Janodet 1 997 ), but extend the 
decorations of nodes with choice identifiers. 



Definition 1 (Expression) Let S be a signature, X a countable set of variables, M a 
countable set of nodes, fl a countable set of choice identifiers. A (rooted) graph over 
(E, A/", X ', f2) is a 5-tuple g = (Af g , C g , S g , 7£oots g , id g ) such that: 

1. J\f g C A/" is the set of nodes of g; 

2. C g : M g — > S U X is the labeling function mapping each node of g to a signature symbol 
or a variable; 

3. S g : J\f g —¥ J\f* is the successor function mapping each node of g to a possibly empty string 
of nodes of g such that if C g (n) = s, where s G S U X, and (for the following condition, 
we assume that a variable has arity zero) arity(s) = k, then there exist n\, . . . , rifc in M g 
such that S g (n) — n\ . . . nk; 
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4. 7£.oots g C J\f g is a subset of nodes of g called the roots of g; 

5. id g : Af g — > il is a partial function mapping nodes labeled by the choice symbol to a 
choice identifier; 

6. if C g (ni) S X and Lgiji^) G X and Cg{n\) = C g (n2), then n\ = ni, i.e., every variable 
of g labels one and only one node of g; and 

7. for each n S Af g , either n 6 7Zoots g or there is a path from r to n where r £ 7Zoots g , i.e., 
every node of g is reachable from some root of g. 

A graph g is called a term (graph), or more simply an expression, iflZoots g is a singleton. 

Typically we will use "expression" when talking about programs and "graph" when mak- 
ing formal claims. Choice identifiers play a role in computations. Thus, we will explic- 
itly define the id mapping only after formally defining the notion of computation. Term 
graphs can be seen, e.g., in Figs. Q]and|2] Every choice node of every graph of Fig. [3] 
would be decorated with the same choice identifier. Choice identifiers are arbitrary and 
only compared for equality. Node names are arbitrary and irrelevant to most purposes and 
are typically omitted. However, some definitions and proofs of our claims need to explicitly 
refer to some nodes of a graph. For this purpose, we adopt the linear notation for graphs 
(lEchahed andJanodet 19971 Def. 4). With this convention, the left graph of Fig. [T|is de- 
noted no : (rii :f lip 712 : coin, 77,3 :f lip 712), where the node names are the italicized 
identifiers starting with 'n'. We also make the convention that names of nodes that do not 
need to be referenced can be omitted, hence (flip 712 :coin, flip 712). The latter is 
conceptually identical to (@J. In the linear notation for graphs, infix operators are applied 
in prefix notation, e.g., see Lemma[3] This practice eases understanding the correspondence 
between a node identifier and the label of that node. 

The definition of graph rewriting (E chahed and Janodet 19971 |Plump 1999| > is more la- 
borious than, although conceptually very similar to, that of term rewriting (Ba ader and Nipkow 1998| 
IBezem et al. 2003llDershowitz and Jouannaud 1990l|Klop 1992[ >. Sections 2 and 3 of (lEchahed and Janodet 19971 > 
formalize key concepts of graph rewriting such as replacement, matching, homomorphism, 
rewrite rule, redex, and step in a form ideal for our discussion. Therefore, we adopt entirely 
these definitions, including their notations, and only discuss the manipulation of choice 
identifiers, since they are absent from ( Echahed and Janodet 1997l >. 



4.2 Programs 

We now formalize the class of rewrite systems that we consider in this paper. A program 
is a rewrite system in a class called limited overlapping inductively sequential, abbreviated 
LOIS. In LOIS systems, the rules are left-linear and constructor-based (IQ'Donnell 1985b . 
The left-hand sides of the rules are organized in a hierarchical structure called a definitional 



tree ( Antoy 1992 1 that guides the evaluation strategy ( Antoy 2005 1. In LOIS systems, there 
is a single operation whose rules' left-hand sides overlap. This operation is called choice, 
is denoted by the infix binary operation "?", and is defined by the rules of (01. 

LOIS systems have been investigated in some depth. Below we highlight informally the 
key results that justify our choice of LOIS systems. 



1 . Any LOIS system admits a complete, sound and optimal evaluation strategy ( Antoy 1997 1. 
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2. Any constructor-based conditional rewrite system is semantically equivalent to a 



LOIS system (Antoy 2001 



3. Any narrowing computation in a LOIS system is semantically equivalent to a rewrit- 
ing computation in another similar LOIS system ( Antoy and Hanus 2006 1. 

For the above reasons, LOIS systems are an ideal core language for functional logic pro- 
grams. Informally summarizing, LOIS systems are general enough to perform any func- 
tional logic computation (Antoy 2001 1 and powerful enough to compute by simple rewrit- 
ing (Antoy and Hanus 2006 1 and without wasting steps (Antoy 1 997)1. 



4.3 Computations 

In our setting, a computation of e is a sequence e = eo «> e\ e> . . . such that -=» ej+i 
is a step, i.e., is either one of two graph transformations: a rewrite, denoted by or a 
pull-tab, denoted by "h". A rewrite is the replacement in a graph of an instance of a rewrite 
rule left-hand side (the redex) with the corresponding instance of the rule right-hand side 
(the replacement). The pull-tab transformation is formally defined in the next section. In 
principle, we do not exclude choice reductions, i.e., non-deterministic steps, but in practice 
we limit them to the root of an expression. The reason is that reducing a choice makes 
an irrevocable commitment to one of its alternatives. Pull-tab steps are equivalent to non- 
deterministic steps in the sense, formally stated and proved in the next section, that they 
produce all and only the same results, but without any irrevocable commitment. 

A computation can be finite or infinite. A computation is successful or it succeeds iff 
its last element is a value, i.e., a constructor normal form. A computation is a. failure or it 
fails iff its last element is a normal form with some node labeled by an operation symbol. 
In non-deterministic programs, such as those considered in this paper, the same expression 
may have both successful computations and failures. Each expression of a computation is 
also referred to as a state of the computation. 

A strategy determines which step(s) of an expression to execute. Essential properties of 
a strategy, such as to succeed whenever possible, will be recalled in Sec. [5] 



4.4 Transformations 

As described in Section 14.31 a computation is a sequence of expressions such that each 
expression of the sequence, except the first one, is obtained from the preceding expression 
by either of two transformations. One transformation is an ordinary redex replacement. We 
defer to (Echa hed and Janodet 19981 Def. 23) the precise formulation of this transforma- 
tion and to the next section the handling of decorations by this transformation. The second 
transformation is defined below. 

Definition 2 (Pull-tab) Let g be an expression, n a node of g, referred to as the target, not 
labeled by the choice symbol and Si . . . Sk the successors of n in g. Let i be an index in 
{1, . . . fc} such that referred to as the source, is labeled by the choice symbol and let t\ 
and ti be the successors of si in g. Let gj, for j = 1 , 2, be the graph whose root is afresh 
node rij with the same label as n and successors s\ . . . Si-\tjSi+i . . . Sk- Let g' = <?i?<?2- 
The pull-tab of g with source s, and target n is g[n «— g'] and we write g^g[n «— g']. 
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Fig. |3] depicts the result of a pull-tab step. For a trivial textual example, ((0 + 2) ? (1 + 
2)) * 3 is the pull-tab of ((0 ? 1) + 2) * 3. The definition excludes targets labeled by the 
choice symbol. These targets are not a problem for the pull-tab transformation, but would 
complicate, without any benefit, our treatment. 

A pull-tab step is conceptually very similar to an ordinary step — in a graph a (sub)graph 
is replaced. The difference with respect to a rewrite step is that the replacement is not 
constructed by a rewrite rule, but according to Def. [2] It seems very natural for pull-tab 
steps too to call redex the (sub)graph being replaced. 

Term and graph rewriting are similar formalisms that for many problems are able to 
model the expressions manipulated by functional logic programs. Not surprisingly, expres- 
sions are terms in term rewriting and graphs in graph rewriting. A significant difference 
between these formalisms is the identification of a subexpression of an expression. Term 
rewriting uses positions, i.e., paths in a tree, whereas graph rewriting uses nodes. Nodes 
are used for defining both rewrite rules and expressions to evaluate. Nodes are "placed in 
service" (1) to define rewrite rules, (2) when an expression, called top-level, is defined or 
created for the purpose of a computation, and (3) to define or construct the replacement 
used in a step. We agree that any node is placed in service only once, i.e., the same node is 
never allocated to distinct top-level expressions and/or replacements. However, the same 
node may be found in distinct graphs related by a step, since a step makes a localized 
change in a graph. These stipulations are formalized by the following principle, which is a 
consequence of placing nodes in service only once. 

Principle 1 (Persistence) Let g\ and gi be graphs. Ifn is a node in M 9l D N 92 , then there 
exists a graph g such that g =» g\ and g a> 92- 

4.5 Decorations 

To support pull-tabbing and ensure its correctness we attach additional information to an 
expression. This additional information is formalized as a decoration of a node similar to 
other decorations present in graph, e.g., label and successors. In this section, we rigorously 
define the function that maps nodes to choice identifiers. 

Definition 3 (Decorations) Let A : go e> g\ e> . . . be a computation. We define the id gi 
mapping, for each element gi of A, by induction on i, as follows: id 9i takes a node of gi 
labeled by the choice symbol and produces the node 's choice identifier. Base case: i = 0. 
id go (n), where n is in go and is labeled by the choice symbol, is an arbitrary element 
of D,, provided that id go is one-to-one. Ind. case: i > 0. By the induction hypothesis, 
id gi _ ± is defined for any choice node. In the step gi-\ e> gi, whether rewrite or pull-tab, a 
subexpression of gi-\ rooted by a node p is replaced by an expression rooted by a node q. 
Let n be a node of g^ 

1. If n is a node of gi-\ labeled by the choice symbol, then id gi (n) = id gi _ 1 {n). 

2. Otherwise, if gi-\ — > gi (an ordinary rewrite) and n is labeled by the choice symbol, then 
id gi (n) = a, for an arbitrary a £ 51 provided that id gj (m) ^ a for all j < i and all 
m G gj and id Bi (n) ^ id gi (m) for n ^ m (i.e., a is fresh). 
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3. Otherwise, if gi-\ = gi (a pull-tab) and n = q, then id gi (n) = id gi _ 1 (m), where m is the 
source node of the pull-tab. 

The above definition is articulated, but conceptually simple. Below, we give an informal 
account of it. In a typical step g -s» g 1 , most nodes of g end up in g' . The choice identi- 
fier, for choices, of these nodes remains the same. In a rewrite, some nodes are created. 
Any choice node created in the step gets a fresh choice identifier. In a pull-tab, informally 
speaking, the source (a choice) "moves" and the target (not a choice) "splits." The choice 
identifier "moves" with its source. Split nodes have no choice identifier. 

Each node in the "universe" of nodes M may belong to several graphs. In (Echahed andJanodet 19971 ). 
and accordingly in our extension (see Defs.Q]and[3]), the function mapping a node to a dec- 
oration depends on each graph to which the node belongs. It turns out that some decorations 
of a node, e.g., the label, are immutable, i.e., the function mapping a node to such decora- 
tions does not depend on any graph. We prove the immutability claim for our extension, 
the choice identifier. Obviously, there is no notion of time when one discusses expressions 
and considers the decorations of a node. Hence immutable decorations "are set" with the 
nodes. In practice, these decorations "become known" when a node is "placed in service" 
for the purpose of a computation or is created by a step. 

Lemma 1 (Immutability) Let g\ and gi be expressions. Ifn is a node in Af gi H M g2 , then 
id gi (n) = id g2 (n). 

Proof If a node n belongs to J\f gi PI JV g2 , then, by Principle[U there exists an expression 
g and computations A\ : g «> g 1 and A2 : g .92- By induction on the length of A\, 
resp. A2, using point 1 of Def. [3] id gi {n) = id g (n), resp. id g2 (n) = id g (n). The claim 
follows by transitivity. □ 

In view of this result, we drop the subscript from id since this practice simplifies the nota- 
tion and attests a fundamental invariant. 

Pull-tab steps may produce an expression with distinct choices with the same choice 
identifier. The same identifier tells us that to some extent these redexes are the "same". 
Therefore, when a computation replaces one such redex with the left, resp. right, alter- 
native, every other "same" redex should be replaced with the left, resp. right, alternative, 
too. If this does not happen, the computation is unacceptable. The notion of consistency of 
computations introduced next abstracts this idea. 

Definition 4 (Consistency) A rewrite step that replaces a redex rooted by a node n labeled 
by the choice symbol is called a choice step. A computation A is consistent iff for all a £ CI, 
there exists an i (either 1 or 2) such that every choice step of A at a node identified by a 
applies rule C'i of " ? " defined in (0. 

5 Correctness 

A strategy determines which step(s) of an expression to execute. A strategy is usually de- 
fined as a function that takes an expression e and returns a set S of steps of this expression 
or, equivalently, the reducts of e according to the steps of S. We will not define any specific 
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strategy. A major contribution of our work is showing that the correctness of pull-tabbing 
is strategy-independent. 

The classic definition of correctness of a strategy S is stated as the ability to produce 
for any expression e (in the domain of the strategy) all and only the results that would be 
produced by rewriting e. "All and only" leads to the following notions. 

Soundness: if e v is a computation of e in which each step is according to S and v is a 
value (constructor normal form), then e — > v. 

Completeness: if e — > v, where v is a value (constructor normal form), then there exists a 
computation e -s> u in which each step is according to 5. 

In the definitions of soundness and completeness proposed above, the same expression is 
evaluated both according to S and by rewriting. This is adequate with some conventions. 
Rewriting is not concerned with choice identifiers. This decoration can be simply ignored 
in rewriting computations. In particular, in rewriting (as opposed to rewriting and pull- 
tabbing) a computation is always consistent. In graph rewriting, equality of graphs is mod- 
ulo a renaming of nodes. A precise definition of this concept is in (Echahe d and Janodet 19971 
Sect. 2.5). 

Typically, the proof of soundness is trivial for strategies that execute only rewrite steps, 
but our strategy executes also pull-tab steps, hence it creates expressions that cannot be 
produced by rewriting. Indeed, some of these expressions will have to be discarded to 
ensure the soundness. The proof of correctness of pull-tabbing is non-trivial and relies on 
two additional concepts, representation and invariance, which are presented in following 
sections. 



5.7 Parallel Moves 



Proofs of properties of a computation are often ac- 
complished by "rearranging" the computation's steps 
in some convenient order. A fundamental result 
in rewriting, known as the Parallel Moves Lemma 
(Huet and Levy 1991 1, shows that in orthogonal sys- 



tems the steps of a computation can be rearranged at 
will. A slightly weaker form of this result carries over 
to LOIS systems. A pictorial representation of this re- 
sult is provided in Fig. 4. The symbol denotes the 
reflexive closure of the rewrite relation. The notation 
"^n r ", where n is a node and r is a rule, denotes ei- 
ther equality or a rewrite step at node n with rule r. 




Fig. 4. The Parallel Moves 
Lemma for LOIS graph rewrit- 
ing systems under appropriate 
conditions on nodes and rules. 



Lemma 2 {LOIS parallel moves) Let e, e\ and e 2 be expressions such that e 



1 n\,ri 



and ri 



e% where for i = 1,2, rij is a node and ri is a rule. If n\ ^ n 2 or both n\ = n 2 
then there exists an expression e! such that {modulo a renaming of nodes) 



r-2 



rj- e 2 . 



Proof By cases on the assumption's condition. When both ri\ = ri2 and r\ = r%, the 
two steps are the same, hence e\ = e! = e 2 . When n\ ^ n 2 : the claim is a restriction of 
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5.2 Representation 

A characteristic of pull-tabbing, similar to bubbling and copying, is that the completeness 
of computations is obtained by avoiding or delaying a commitment to either alternative of 
a choice. In pull-tabbing, similar to bubbling, both the alternatives of a choice are kept or 
"represented" in a single expression throughout a good part of a computation. The proof 
of the correctness of pull-tabbing is obtained by reasoning about this concept, which we 
formalize below. 



Definition 5 (Representation) We define a mapping R that takes an expression g and 
returns a set R g called the represented set of g as follows. Let g be an expression. An 
expression e is in R g iff there exists a consistent computation g A e (modulo a renaming 
of nodes) that makes all and only the choice steps of g. 

In other words, we select either alternative for every choice of an expression. For choices 
with the same identifier, we select the same alternative. Since distinct choice steps occur 
at distinct nodes, by Lemma|2]the order in which the choice steps are executed to produce 
any member of the represented set is irrelevant. Therefore, the notion of represented set 
is well defined. The notion of represented set of g is a simple syntactic abstraction not to 



be confused with the notion of set of values of an expression g ( Antoy and Hanus 2009 1, 
which is a semantic abstraction fairly more complicated. 



5.3 Invariance 

The proof of correctness of pull-tabbing is based on two results that informally speaking 
establish that the notion of represented set is invariant both by pull-tab steps and by non- 
choice steps. 

Lemma 3 (Invariance by pull-tab) If g a g' is a pull-tab step, then (1) for any expression 
e £ R g , there exists an expression e' £ R g i such that e = e' (modulo a renaming of nodes), 
and (2) for any expression e' £ R g i, there exists an expression e £ R g such that e = e' 
(modulo a renaming of nodes). 

Proof We set up the notation for both claims. If g a g' is a pull-tab step, then by Def.|2l 

g = C[n f :f(s 1 ,... 1 n:7(n 1 :x,n 2 :y),...s k )} 

g' = C[n':?(n fl :f(s 1 ,...n 1 :x,...Sk),n f2 :f(s 1 ,...n 2 :y,...Sk))} 

where C is some context; rif, n, m, n<x, n! , nf 1 and nj 2 are nodes; / ^ ?; s\, . . . Sfc and 
x and y are expressions; n is the i-th successor of nf, ri\, resp. n 2 , is the z-th successor of 
n f 1 , resp. n f 2 . 

Claim (1): let A : g A e be a computation witnessing that e £ R g , i.e., a consistent com- 
putation making all and only the choice steps of g. From this computation we construct a 
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computation A' of g' that produces an expression e' satisfying the claim. Without loss of 
generality, since the notion of represented set is well defined, we assume that the first step 
of A is g — >n,o, h, for some h, where Cj is either C\ or C2 of (0. Let h and ft/ be expres- 
sions defined by the following computation: g' -^ n ',Cj h c ^ ■ Node n may or may 
not be in g' . In particular, n is in g' iff n has more than one predecessor in g. If n G Af g >, 
then n G Af^; otherwise, the step h -? n c . h' does not replace any subexpression of h! and 
h' = ft. We explicitly construct a homomorphism p : Afh — > Afh' that shows that ft = ft/ 
modulo a renaming of nodes. We show that (a) for each node m G Afh, with m 7^ n/,, 
m G A/ft/, and vice versa, (b) for each node m G Afh', with m 7^ n^, m G A/^. To prove 
(a), let m 7^ nf be a node of /i. Since g —^n,Ci ft is a choice step, m is either in the context 
C of g or in the subexpression rooted by rij . These portions of g are preserved by the steps 
that produce g 1 , ft and ft'. Thus, m G Afh'- The proof of (b) is analogous. Therefore, we 
define p{nj) = Uf j and p(m) = m, if m ^ nf. By (a) and (b) and by construction, p is a 
bijection. By construction, p preserves root, label, and successors of every node. Thus the 
computation A 1 starts with g —> h, followed byh^h'ifh^h'. Then, for any step of A 
starting with expression h at node p with rule r there is a step of A' starting with expres- 
sion h' at node p(p) with rule r. These computations start at equal expressions (modulo a 
renaming of nodes) and make the same steps, hence they end at equal expressions (modulo 
a renaming of nodes). Since A is consistent, so is A'. Let e' be the last expression of A'. 
This proves that e' G R g ' . 

Claim (2): let A' : g' A e' be a computation witnessing that e' G R g >, i.e., a consistent 
computation making all and only the choice steps of g 1 . From this computation we con- 
struct a computation A of g that produces an expression e' satisfying the claim. Without 
loss of generality, since the notion of represented set is well defined, we assume that A' 
begins with the steps g' -^ n ',Cj h -? n c ^ ■ The m l e must be the same in both steps 
because, if n G Af^, then id(n) = id(n r ) and A' is consistent. Node n may or may not be 
in g' . In particular, n is in g' iff n has more than one predecessor in g. If n G Af g >, then 
?i G Af^; otherwise, the step h -? n c ft/ does not replace any subexpression of h! and 
ft/ = h. We define the first step of A as 5 — > n ,Cj h. The rest of the proof is substantially 
equal to that of Claim (1). We complete A with the same steps of A' past h' and obtain an 
expression e in R g . We show in exactly the same way that, modulo a renaming of nodes, 
ft = ft' and consequently e = e'. □ 



Lemma 4 (Invariance by non-choice) 7/\g — > g' is a rewrite non-choice step, then (1) for 
any expression e G R g , there exists an expression e' G R g ' such that e A e! {modulo a 
renaming of nodes), and (2) for any expression e' G R g >, there exists an expression e G R g 
such that e A e! (modulo a renaming of nodes). 

Proof Claim (1): let A ; g = go — > g\ — > . . . g n = e be a computation witnessing that 
e G R g , i.e., a consistent computation making all and only the choice steps of g. From A, 
we construct a computation A' of g' that produces an expression e' satisfying the claim. 
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Consider the following diagram, where the top row is A and the bottom row is A': 

9 = 90 ->- 9i —> 9n = e 

I |= 1 = 

9 = 9o 9i -> 9 n ~* e 

By induction on i, for i = 1, ... n, we both define g' i _ 1 — > g\ and prove that the diagram 
commutes. To support the induction, we strengthen the statement to include the definition 
of the step gi -4 g[ and the condition that this step is not a choice step. Base case, i = 1: 
The steps g go — ► 3i are given by the assumptions. Since the first is not a choice step 
and the second is a choice step, they are at distinct nodes. Hence, Lemma[2]gives the steps 
9o ~? 9i ^~ 9i an d me commutativity of the diagram. The step g\ -4 g[ is not a choice 
step because either g\ = g[ or it is at the same node as go — > g' Q . Ind. case, i > 1: The 
steps <— — > gi are given by the induction hypothesis and assumption, respec- 
tively. Since one is a choice step and the other is not, they are at distinct nodes. Hence, 
Lemma|2]gives the steps g' i _ 1 -? g[ %- gi and the commutativity of the diagram. The step 
gi g'i is not a choice step because either = g^or it is at the same node as i g' i _ 1 . 
Since A is consistent, A' up to g' n is consistent as well. We reduce any remaining choice 
of g' n consistently with the preceding steps of A', say g' n A e', to produce an expression 
e' 6 R g > . Thus, by the commutativity of the diagram e = g n -? g' n A e' witnesses the 
claim. 

Claim (2): let B : g' A e' be a computation witnessing that e' S i? g ' , i.e., a consistent com- 
putation making all and only the choice steps of g'. From this computation we construct a 
computation of g that produces an expression e satisfying the claim. Let Q = Af g nAf g > , i.e., 
be the set of nodes both in g' and g. Suppose that the cardinality of Q is n, for some n 0. 
We reorder the steps of B, which is possible by Lemma|2] so that any step at some node of 
Q occurs before any step at some node not in Q. Let A' : g' = g' — > g[ — y . . . g' n A e' be 
one such computation. From A', we construct a computation A that produces an expres- 
sion e satisfying the claim. Consider the following diagram, where the top row is A' and 
the bottom row is A: 

tit i*i 
9 = 9o 9i 9 n "Z" e' 

t 1 1\ 

.9 = .9o ^ 9i —) g n -Hs- e 

By induction on i, for i = 1, ... n, we both define — > and prove that the diagram 
commutes. To support the induction, we strengthen the statement to include the definition 
of the step gi g[ and the condition that this step is not a choice step. Base case, i = 1: 
Let go be the root of the redex of g' Q —>q ,r 9\- By assumption, go S Q. Hence g £ A/" So . 
We let go — > qQ , ra 9i- Thus we have the steps g' go — > g\ where by assumption the 
first is not a choice step and by construction the second is a choice step. Since these steps 
are at distinct nodes, by Lemma |2] there exists some g" such that g' -^ qa ra g" ~<— g\. 
Therefore, g" = g[ and the diagram commutes. The step g\ -5 g[ is not a choice step 
because either g\ = g[ or it is at the same node as go — > g Q . Ind. case, i > 1: Let qi-\ 
be the root of the redex of g' i _ 1 — > qi _ ltn _ 1 g[. By assumption, g;_i S Q, hence in g. 
Thus, node gj_i in g\_ x is not created by the step g.j_i -4 g^i- Consequently, gj_i is a 



726 



S. An toy 



node of <7i_i too. We let <7j_i — >- gi _ 1 ri _j gi. Thus we have the steps g.;_i — > 9, 

where by assumption the first is not a choice step and by construction the second is a 
choice step. Since these steps are at distinct nodes, by Lemma|2]there exists some g" such 
that g[_ l -5qi-x rt-i d" 9i- Therefore, g" = g\ and the diagram commutes. The step 
fji g'i is not a choice step because either gi = g[ or it is at the same node as </j_i — > g' i _ 1 . 
Since A' is consistent, A up to g n is consistent as well, since corresponding steps use the 
same rule. We reduce any remaining choice of g n consistently with the preceding steps of 
A, say g„ A e, to produce an expression e G R g . We show that e -4 g' n . If e ^ g' n , then 
there exists a choice step g n — > g n +i in A. By construction, this step is at some node q„ 
which is not in Q and hence is not in g' n . This means that the step g n — > g' n erases node q n . 
Thus, we have the steps g' n 1— g n — > 9 „,r ffn+i, for some rule r, where by construction the 
first is not a choice step and by assumption the second is a choice step. Since these steps 
are at distinct nodes, by Lemma |2] there exists some g" such that g' n -^ qn r g" 1— 9n+i- 
Since q n g" Ng' , 9" = g' n ar, d 9n+i g'„- The same above reasoning proves that for 
any expression h of g n A e in A, h -5 g' n . In particular, e -5 g' n . Thus, e -5 g' n A e' 
witnesses the claim. □ 

We combine the previous lemmas into computations of any length. 

Corollary 1 If g a> g' with no choice steps, then (1) for any expression e G R g , there 
exists an expression e' G R g i such that e — > e' (modulo a renaming of nodes), and (2) for 
any expression e' G f/zere exists an expression e G R g such that e A e' (modulo a 
renaming of nodes). 

Proof Both claims are proved by a trivial induction on the number of steps of g A g' 
using Lemmas [3] and |4] □ 



Theorem 1 (Correctness) If ' g A g' with no choice steps, then (1) for any value v such 
that g — > v is a consistent computation, there exists a value v' such that g' A v' is a 
consistent computation, and v — v' (modulo a renaming of nodes), and (2) for any value 
v' such that g' A v' is a consistent computation, there exists a value v such that g A v is 
a consistent computation, and v — v' (modulo a renaming of nodes). 

Proof Claim (1): let A : g A v a consistent computation of g into a value v. By Lemma|2] 
without loss of generality we assume that A : g A e A v, where the segment g A e 
consists of all the choice steps of g. Since A is consistent, e G R g . By Corollary [TJ there 
exists a consistent computation g' A e! such that e = e! (modulo a renaming of nodes). 
Since e = e' (modulo a renaming of nodes) and e A v, there exists a computation e! A u' 
such that v = v' (modulo a renaming of nodes). 

Clam (2): the proof is analogous to that of claim (1). □ 

TheoremQ] suggests to apply both non-choice and pull-tab steps to an expression. Choices 
pulled up to the root are reduced consistently and without context cloning. Of course, by the 
time a choice is reduced, all its spines have been cloned — similar to bubbling and copying. 
A better option, available to pull-tabbing only, is discussed in the next section. 
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6 Application 

The pull-tab transformation is meant to be used in conjunction with some evaluation strat- 
egy. We showed that pull-tabbing is not tied to any particular strategy. However, the strat- 
egy should be pull-tab-aware in that: (1) a choice should be evaluated (to a head normal 



form) only when it is needed ( Antoy 1997 1, (2) a choice in a root position is reduced (con- 
sistently), whereas in a non-root position is pulled, and (3) before pulling a choice, one 
of the choice's alternatives should be a head-normal form. The formalization of such a 
strategy would take us well beyond the scope of this paper. 

In well-designed, non-deterministic programs, either or both alternatives of most (but 



not all) choices should fail (Antoy 2010 1. Under the assumption that a choice is evaluated 



to a head normal form only when it is needed (Antoy 1997 1, if an alternative of the choice 
fails, the choice is no longer non-deterministic — the failing alternative cannot produce a 
value. Thus, the choice can be reduced to the other alternative without loss of completeness 
and without context cloning. This is where pull-tabbing is advantageous over copying and 
bubbling — any portion of a choice's context not yet cloned when an alternative fails no 
longer needs to be cloned. Of course, the implementation must still identify the choice, 
and choice's single remaining strand as either left or right, to ensure consistency. 



7 Related Work 

We investigated pull-tabbing, an approach to non-deterministic computations in functional 
logic programming. Section[5]recalls copying and bubbling, the competitors of pull-tabbing. 
Here, we briefly highlight the key differences between these approaches. Pull-tabbing en- 
sures the completeness of computations in the sense that no alternative of a choice is left 
behind until all the results of some other alternative have been produced. Similar to every 
approach with this property, it must clone portions of the context of a choice. In contrast 
to copying and bubbling, it clones the context of a choice in minimal increments with the 
intent and the possibility of stopping cloning the context as soon as an alternative of the 
choice fails. 

The idea of identifying choices to avoid combining in some expression the left and right 
alternatives of the same choice appears in (Brasse Fand Huch 20071) . The idea is developed 
in the framework of a natural semantics for the translation of (flat) Curry programs into 
Haskell. A proof of the correctness of this idea will appear in (Bra ssel 201 II) which also 
addresses the similarities between the natural semantics and graph rewriting. This discus- 
sion, although informal, is enlightening. 



8 Conclusion 

We formally defined the pull-tab transformation, characterized the class of programs for 
which the transformation is intended, extended the computations in these programs to in- 
clude the transformation, proved the correctness of these extended computations, and de- 
scribed the condition that reduces context cloning. In contrast to its competitors, in pull- 
tabbing any step is a simple and localized graph transformation. This fact should ease exe- 
cuting the steps in parallel. Future work, aims at defining a pull-tab-aware parallel strategy 
and implementing it to measure the effectiveness of pull-tabbing. 
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